STADS: Software Testing as Species Discovery
نویسنده
چکیده
A fundamental challenge of software testing is the statistically well-grounded extrapolation from program behaviors observed during testing. For instance, a security researcher who has run the fuzzer for a week has currently no means (i) to estimate the total number of feasible program branches, given that only a fraction has been covered so far, (ii) to estimate the additional time required to cover 10% more branches (or to estimate the coverage achieved in one more day, resp.), or (iii) to assess the residual risk that a vulnerability exists when no vulnerability has been discovered. Failing to discover a vulnerability, does not mean that none exists—even if the fuzzer was run for a week (or a year). Hence, testing provides no formal correctness guarantees. In this article, I establish an unexpected connection with the otherwise unrelated scientific field of ecology, and introduce a statistical framework that models Software Testing and Analysis as Discovery of Species (STADS). For instance, in order to study the species diversity of arthropods in a tropical rain forest, ecologists would first sample a large number of individuals from that forest, determine their species, and extrapolate from the properties observed in the sample to properties of the whole forest. The estimation (i) of the total number of species, (ii) of the additional sampling effort required to discover 10% more species, or (iii) of the probability to discover a new species are classical problems in ecology. The STADS framework draws from over three decades of research in ecological biostatistics to address the fundamental extrapolation challenge for automated test generation. Our preliminary empirical study demonstrates a good estimator performance even for a fuzzer with adaptive sampling bias—AFL, a state-of-the-art vulnerability detection tool. The STADS framework provides statistical correctness guarantees with quantifiable accuracy.
منابع مشابه
The False Discovery Rate in Simultaneous Fisher and Adjusted Permutation Hypothesis Testing on Microarray Data
Background and Objectives: In recent years, new technologies have led to produce a large amount of data and in the field of biology, microarray technology has also dramatically developed. Meanwhile, the Fisher test is used to compare the control group with two or more experimental groups and also to detect the differentially expressed genes. In this study, the false discovery rate was investiga...
متن کاملComparative transcriptome analysis of isogenic cell line models and primary cancers links capicua (CIC) loss to activation of the MAPK signalling cascade
CIC encodes a transcriptional repressor, capicua (CIC), whose disrupted activity appears to be involved in several cancer types, including type I low-grade gliomas (LGGs) and stomach adenocarcinomas (STADs). To explore human CIC's transcriptional network in an isogenic background, we developed novel isogenic CIC knockout cell lines as model systems, and used these in transcriptome analyses to s...
متن کاملVERDICTS : Visual Exploratory Requirements Discovery and Injection for Comprehension and Testing of Software
We introduce a methodology and research tools for visual exploratory software analysis. VERDICTS combines exploratory testing, tracing, visualization, dynamic discovery and injection of requirements speci cations into a live quick-feedback cycle, without recompilation or restart of the system under test. This supports discovery and veri cation of software dynamic behavior, software comprehensio...
متن کاملThe Chronobiology of Stanford Type A Aortic Dissections: A Comparison of Northern versus Southern Hemispheres.
BACKGROUND Seasonal variations of Stanford Type A dissections (STADs) have been previously described in the Northern Hemisphere (NH). This study sought to determine if these variation are mirrored in the Southern Hemisphere (SH). METHODS Data from patients treated surgically for STADs were retrospectively obtained from existing administrative and clinical databases from NH and SH sites. Data ...
متن کامل